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Social network structure is very important for understanding human information diffusing, coop- 
erating and competing patterns, ft can bring us with some deep insights about how people affect 
each other. As a part of complex networks, social networks have been studied extensively. Many im- 
portant universal properties with which we are quite familiar have been recovered, such as scale free 
degree distribution, small world, community structure, self-similarity and navigability. According to 
some empirical investigations, we conclude that our social network also possesses another important 
universal property. The spatial structure of social network is scale invariable. The distribution of 
geographic distance between friendship is about Pr(d) oc gT 1 which is harmonious with navigability. 
More importantly, from the perspective of searching information, this kind of property can benefit 
individuals most. 

PACS numbers: 89.75.Hc, 87.23.Ge, 89.20.Hh, 05.10.-a 



What does our social network structure look like? 
How does the structure benefit us? Understanding the 
structure of the social network which has been weav- 
ing by us and we live in is a very interesting problem. 
As a part of the recent surge of interest in networks, 
there has been many researches about social network 
[3, S, i 1, 1, 1, 0, S ©, M, ED, El 0, Q El • Social net- 
work is a typically complex network. It possesses some 
familiar properties such as small- world , scale free [|[ , 
community structure [1] and self-similarity [f| @ . More 
interesting, social network has a special property of navi- 
gability |12u ] . The navigable property of social network 
has become the subject of both experimental and theoret- 
ical researchB 1, M, m, [H, H, M M, 01 M, H HI HH? 

]. Recently, Liben-Nowell et al. explored the role of ge- 
ography alone in routing within a large, online social net- 
work. They used data from about 500 thousand members 
of the LiveJournal online community, who made avail- 
able their state and city of residence, as well as a list of 
other LiveJournal friends. Message- forwarding simula- 
tions based on these data showed that a routing strategy 
based solely on geography could successfully find short 
chains in the network. They also found that the density 
function Pr(d) of geographic distance d between friend- 
ship is Pr(d) oc d _1 . This result seems contradicted with 
Kleinberg's theoretic results Q, which means our social 
network is not navigable. Liben-Nowell et al. argued that 
this kind of contradiction is caused by the nonuniform 
population density, then they presented a new model to 
explain navigable property of social network. Almost at 
the same time, however Lada Adamic and Eytan Ada also 
found the same phenomena [2(J. They investigated a rel- 
atively small social network, the HP email network. The 
email network is based on HP buildings. Lada Adamic 
and Eytan Ada also cannot explain the contradiction 
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well. They thought it is caused by the limiting geom- 
etry of the buildings. But more recently, R. Lambiotte 
et al. investigated a large mobile phone communication 
network [9(. The network consists of 2.5 million mobile 
phone customers that have placed 810 million communi- 
cations and for whom they have geographical home lo- 
calization information. Their empirical result shows that 
the mobile phone communication network is correspond- 
ing to Kleinberg's theory. Do Lada Admaic, Eytan Ada, 
Liben-Nowell and R. Lambiotte et al. show us a univer- 
sal phenomenon or just a coincidence? In this letter we 
will show that with the distribution of geographic dis- 
tance between friendship is Pr(d) oc d , our social net- 
work is navigable, even the population density is nonuni- 
form or some geometry limiting. This kind of distribu- 
tion is also harmonic with Kleingber's theory when the 
density of population is uniform. So we think this kind 
of scale invariant distribution of geographic distance be- 
tween friendships is universal. 

Why does the spatial structure of our social networks 
possess the property and how does this distribution ben- 
efit us? Even the scaling law in the spatial structure 
makes the social network navigable. We do not think 
to let the individuals sending message efficiently is the 
right answer. In the following two sections we will firstly 
conclude that our social network possess the property of 
Pr(d) oc d^ 1 and then we will give the answer to the 
above two questions from the perspective of optimal col- 
lecting information. 



I. SPATIAL STRUCTURE AND NAVIGABILITY 

According to the facts mentioned above, we use a scale 
invariant friendship network (SIF for a short) [2^| to 
model real social network, even when the population den- 
sity is nonuniform. Like Kleinberg's model (K for a short) 
[lfj |. we also employ a lattice as the ground regular net- 
work in which each node possess a weight (population 
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density). Each node u has a short-range connection to all 
nodes within p lattice steps, and q long-range connections 
generated independently from a distribution Pr(d) ~ d a 
(density function). In order to keep model simple and 
not to lose any generality we always set q = p = 1. 
For each long-range connection of u, we first randomly 
choose a distance d according to the above power law dis- 
tribution. Then randomly choose a node v (proportional 
to v's weight) from the node set in which the distance 
for u to each element is d. At last, generate a directed 
long-range connection from u to v. When population 
density is nonuniform, compared with K model, SIF al- 
ways keeps the distribution of geographic distance be- 
tween friendship scale invariant in any situations. When 
the population density is uniform, the probability that 
node u chooses node v as its long-range contact in SIF is 

Prsi F {u,v,a ) = — r— 2 (1 

where c(u,v) = \{x\d(u,x) — d(u,v),x £ S}\, S is the 
set of all nodes in SIF network and 

Pr K (u,v,P) = Y^dJ^xf (2) 
in K network [io| . We have 

Pr K (u,v,-k) _ 

Pr S i F (u,v,-l) { 1 

for fc-dimensional lattice based network. It implies that 
SIF network with a = — 1 corresponds to the result in 
K network with (3 — — fc when the population density 
is uniform. Here, we should note that in fc-dimensional 
based lattice, if node u connect to node v with probability 
proportional to d(u,v)P, it does not equal Pr{d) oc d@ , 
but Pr(d) oc d k+ P~ l . From the above discussion, we 
know that Kleinberg's result is not contradicted with the 
empirical results but well correspond to them. 

We also can prove that, our social network is navigable 
just according to the distribution of geographic distance 
between friendship is Pr(d) oc d~ l . The expectation of 
decentralized search is at most 0(log 4 n) for nonuniform 
population density. Here, we focus on the 2-dimensional 
lattice, and the analysis can be applied to fc-dimensional 
lattice networks. We can easily make the following two 
assumptions (1) In each small enough region, the popu- 
lation density is uniform. (2) The maximum population 
of all small region are M and minimum population is 
to > 0. Under the two assumptions, easily we have 

M m 
c—d- 1 < Pr(d) < c—d- 1 (4) 

TO M 

in K network with /3 = — 2, where c is a constant. 

Starting from the analysis of time complexity of navi- 
gation, we compare the following two routing strategies. 
Strategy A, the message that navigates to target t di- 
rectly (the original Kleinberg's greedy routing). Strat- 
egy B, the message firstly navigates to node j, then from 
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FIG. 1: The spatial structure in mobile communication net- 
work. From it we can see that Pr(d) and -j have a roughly 
linear relationship. You can get more detail about the data 
from the article [5j]. 

node j navigates to target t. Obviously, the expectant 
steps spent by Strategy B is not less than Strategy A 
for any node j. Thus we have the expectant steps spent 
on navigation in any small region is at most 0(log 2 n). 
Each small region can be regarded as a node, then we get 
a new 2-dimensional lattice in which each node's weight 
(population) is between M and to. According to the ar- 
ticle [l9j], we have the expectant steps spent among the 
squares is at most 0(|f log 2 n). Thus, the upper bound 
of navigation in nonuniform lattice is 0(-^log 4 n). So, 
with the above spatial structure property, our social net- 
work is navigable. 

From Eq. 01 we can see that if the difference of popu- 
lation density among different areas are not too big, SIF 
with a = — 1 and K model with (3 = — 2 for 2-dimensional 
situation have no essential difference. R. Lambiotte pro- 
vided us the data freely. From their data we also can see 
the same phenomenon showed by Lada Admaic, Eytan 
Ada and Liben-Nowell et ai. Fig. Q] presents the rela- 
tionship between Pr(d) and ^ we can see that they have 
a linear relationship roughly. So from the above empiri- 
cal investigations and theoretical discussion we can cer- 
tainly draw a conclusion that distribution of geographic 
distance between friendships is 

Pr(d) oc d- 1 (5) 



II. OPTIMAL COLLECTING INFORMATION 

Now we face some questions, why does this kind of dis- 
tribution exist in social networks and how does it ben- 
efit individuals? In our social networks, many human 
economical behaviors can be roughly regarded as collect- 
ing information. Making friends also can be looked as 
the way to search information. So, the social network 
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FIG. 2: The relationship among e, f, a and L. a, we shows the 
relationships among /, a and e, where L — 3600. We use color 
to indicate the value of e. b, depicts the changes of informa- 
tion entropy e with the changes of a when / = 100, 300, 700. 

c, shows the optimal a with the changes of average friends 
number /. The error bars denotes the standard deviations. 

d, shows the relationships between optimal a and the lattice 
size L. The error bars denotes the standard deviations. 



should be an optimal network which can benefit people 
for collecting information. What is the optimal a in so- 
cial network? The following model will give us a possible 
answer. 

Suppose, individuals have average finite energy w 
which can be represented by the sum of distances between 
one and his/her friends. For a node u, each time, as the 
rule of SIF, we first randomly choose a distance d accord- 
ing to Pr(d) oc d a , then randomly choose a node v for the 
node set in which the distance for u to each element is d. 
The information that node v bring to u can be denoted by 
node v and all nodes within p lattice steps to node v. Af- 
ter a proper time, the sum of all d chosen will approach w, 
then stop the execution. The information that u hunted 
can be expressed by the sequence of nodes. We use the 
entropy of the sequence to denote the value of informa- 
tion. Then we have the model: maxe = — ~^2 i=1 Pi^'npi 
subjected to Y^JLi d(u,j) = w and Pr{d) oc d a , where, 
Pi denotes the frequency of node i in the information 
sequence. For instance, if the information sequence is 
{1,1,2,3,7}, then pi = § and p 2 = P3 = p-j = f, oth- 



ers are 0. Obviously, more large the s, more information 
hunted. Here, we let p = 1 (if p is not too large we can 
get the same result). The reason is that, according to 
our common sense, if B is a friend of A, A will know 
more information about people who are always around 
B. Actually, we should take account all friends of B, but 
the time complexity will be expensive. We compare the 
two kinds of simulations in not too large networks, they 
have the similar results. 

We simulate the above model on a toroidal lattice. 
The largest distance among pair of nodes in the lat- 
tice is L = 3600. For America, from the north to 
south and from the west to east the largest distances 
are 4500fcm, and 2700fcm respectively, and the average 
is about 3600fcm (here, it is no necessary to make the 
parameter so much accuracy in the model). The average 
number of friends we contact in one year is about / = 300 
[23l ]. According to the empirical result Pr(d) oc we 
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can calculate the average w = -^—^ 



Note that here the 
empirical result of Pr(d) oc d^ 1 is only used to determine 
the parameter value of the model. It is independent of 
the optimal a. Fig. [2] shows the relationship between a 
and /. We can find that, the optimal a depends on /. 
When / is about 300, the optimal a = -0.94 ± 0.08 (± 
standard deviations). This indicates that when people 
just posses finite energy, it is a good way to keep friend- 
ships holding Pr(d) oc d^ 1 . 



III. CONCLUSION 

From the empirical results, we conclude that the dis- 
tance distribution between friendship is scale invariant. 
The distributions is about Pr(d) oc d^ 1 which is an im- 
portant and universal property for social network. It 
not only makes our social network is navigable but most 
importantly it can benefit individuals for searching infor- 
mation. 
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